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We introduce a discrete network corresponding to any Gaussian wireless network that is obtained by 

simply quantizing the received signals and restricting the transmitted signals to a finite precision. Since 

signals in the discrete network are obtained from those of a Gaussian network, the Gaussian network 

can be operated on the quantization-based digital interface defined by the discrete network. We prove 

[__i that this digital interface is near-optimal for Gaussian relay networks and the capacities of the Gaussian 

i— i 

and the discrete networks are within a bounded gap of 0(M ) bits, where M is the number of nodes. 

We also prove that any near-optimal coding strategy for the discrete network can be naturally 

transformed into a near-optimal coding strategy for the Gaussian network merely by quantization. We 

exploit this property by designing a linear coding strategy for the case of layered discrete relay networks. 

The linear coding strategy is near-optimal and achieves all rates within 0(M 2 ) bits of the capacity, 
CO 

independent of channel gains or SNR. The linear code is therefore a near-optimal strategy for layered 

Gaussian relay networks and can be used as-is on the Gaussian network after simply quantizing the 

signals. The linear code is also robust and the relays need not know the channel gains on either the 

incoming or the outgoing links. The transmit and receive signals at all relays are simply quantized to 

binary tuples of the same length n, which is all that the nodes need to know. The linear network code 



is a particularly simple scheme and requires all the relay nodes to collect the received binary tuples 
into a long binary vector and apply a linear transformation on the long vector. The resulting binary 
vector is split into smaller binary tuples for transmission by the relays. The quantization requirements 
of the linear network code are completely defined by the parameter n, which therefore also determines 
the resolution of the analog-to-digital and digital-to-analog convenors that are required for operating 
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the network within a bounded gap of the network's capacity. As is evident from the description, the 
linear network code explicitly connects network coding for wireline networks with codes for Gaussian 
networks. 

I. Introduction 

Even in a simple wireless network, there is an infinitude of options and techniques available 
for cooperation among nodes, and it is of great interest to determine provably near-optimal 
operating strategies. This has motivated research into network information theory that aims to 
determine the fundamental limits on the capacity of wireless networks as well as optimal or near- 
optimal strategies for operating them [|2T|. This is generally a difficult problem and a precise 
characterization has remained elusive for most networks. One approach to circumvent this is to 
obtain scaling laws that shed light on capacity as well as operating strategies, as the number of 
nodes in a network increases (H. Another approach, initiated in the pioneering work of [0, is 
to seek answers that are within a bounded gap of the capacity that is independent of the channel 
gains or SNR, and is a function of only the number of nodes in the network. This is done by 
studying an alternative model that is more tractable then the original Gaussian network. Such a 
bounded gap approximation is valid at all SNRs, and is most relevant in the high power or low 
noise regimes. 

In this paper we focus on the latter direction. We address the issues of both approximating 
the capacity of Gaussian relay networks, and rigorously prove the near-optimality of a coding 
strategy to operate them. Our approach employs the strategy espoused in [2] — study not the 
original Gaussian network, but an alternative model. Our goal in this paper is to develop just such 
an alternative model that can serve as a digital interface for operating Gaussian networks in that 
it simultaneously possesses three properties. First, the alternative model must well-approximate 
the capacity of the Gaussian network up to a constant gap; i.e., the capacity of the original 
Gaussian network and the capacity of the alternative model must be within a bounded gap 
that is independent of channel gains or SNR. Second, it must be possible to easily operate 
the Gaussian network on the interface defined by the new model. The first two properties give 
us the capability to convert any near-optimal coding strategy for the alternative model into a 
near-optimal coding strategy for the original Gaussian network. Their usefulness is however 
contingent on actually being able to design a near-optimal strategy for the alternative model. 
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So, the third property is that the alternative model be tractable in that one can actually design 
near-optimal strategies for it. 

In an earlier paper [1J a superposition model was proposed that had the first property. However, 
Gaussian networks cannot be naturally operated on the interface defined by the model, and near- 
optimal coding strategies for the superposition model are not easy to design or describe. The 
truncated deterministic model proposed in satisfies the first property but has drawbacks similar 
to that of the superposition model. The linear deterministic model proposed in [0 has the third 
property since optimal codes can be designed for it, but not the first property since it cannot 
approximate the capacity of the Gaussian network. It satisfies the second property in a limited 
context since a strategy in the linear deterministic network need not get mapped to an identical 
strategy in the Gaussian network. In this paper, we propose a different model, called a discrete 
model, that has all three properties for the class of layered Gaussian relay networks. In fact, for 
this model we show that a linear network code is actually near-optimal. Thus we establish the 
interesting result that scalar quantization followed by linear network coding is near-optimal for 
the class of Gaussian layered relay networks, achieving rates within a bounded gap of capacity. 

The discrete model is naturally derived from the Gaussian model by simply quantizing the 
received signals and by restricting the transmit signals to a finite alphabet. Thus the stochasticity 
of the Gaussian network carries through into the discrete model and we can construct a discrete 
network that corresponds to any Gaussian network. Note that the discrete model is not a noiseless 
deterministic model as in [|2]| or JT). In order to operate the Gaussian network on the discrete 
interface, one simply quantizes the received signals and restricts the transmit signals to a finite 
alphabet. Hence the discrete network provides a quantization-based digital interface for operating 
Gaussian networks. We prove that the discrete network satisfies the above-mentioned three 
properties for the class of layered Gaussian relay networks. The capacities of the layered Gaussian 
relay network and its discrete counterpart are shown to be within a bounded gap of 0(M 2 ) bits, 
where M is the number of nodes in the network. We construct a simple coding scheme for the 
layered discrete relay network, called the linear network code, which achieves all rates within 
0(M 2 ) bits of the capacity of the discrete network, independently of channel gains or SNR. 
This linear network code is also near-optimal for the layered Gaussian network and can be used 
on it by operating the Gaussian network on the discrete interface. The linear network code is 
a generalization of random linear network coding for wireline networks [5], [6], or for linear 
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deterministic networks [2], As explained in detail in the sequel, the linear network code involves 
random encoding at the source followed by a random linear transformation at the relay. The 
crucial distinction between this scheme and the network coding schemes in the literature is that 
we need to account for the noise in the received signals at the relays and destination. To the best 
of our knowledge, this is the first instance of an explicit connection between network coding 
for wireline and noiseless networks with coding for Gaussian networks, with a bounded-gap 
guarantee on performance. 

A. Previous work on relay networks 

Any summary of work on relay networks has to start with the early and lasting contributions 
in H| and Q. Networks with a single relay node were introduced in [3], and the well-known 
decode-and-forward (DF) and compress-and-forward (CF) schemes for such relay networks were 
developed in [4]. DF was extended to networks with multiple relays in 0, ifTOll . An extension 
of CF to multiple relays was presented in [fTTI . It was proved in [2 J that these extensions of DF 
and CF to relay networks achieve rates which can be arbitrarily lower than the network capacity. 

In fl2}, a fundamental question on the tightness of the cut-set upper bound on the capacity of 
a relay network was answered in the affirmative. It was proved that the capacity of the relay 
network is at most 0(M log M) bits below the cut-set bound, where M is the total number of 
nodes in the network, and the bounded gap is independent of channel gains or the SNR. The 
result was established via constructing a new coding scheme called quantize-map-and-forward 
(QMF) in which the relays quantize the received signals, buffer them, and randomly map them 
to a Gaussian codeword. QMF achieves all rates within 0(M log M) bits of the cut-set bound. 

Another important contribution in [2] was the introduction of the linear deterministic model 
as a technique to approximate Gaussian networks and to gain insights into coding schemes 
for Gaussian networks. For every Gaussian network, we can construct a corresponding linear 
deterministic network. The cut-set bound is the capacity of the linear deterministic relay network 
and can be achieved by random linear codes. However it was shown in fl2]|, JTJ that the linear 
deterministic model does not approximate the capacity of Gaussian relay networks and the 
capacity of the Gaussian network can be arbitrarily higher than that of the corresponding 
linear deterministic network. It was proved in [2] that the truncated deterministic model can 
approximate the capacity of Gaussian relay networks within a gap of 0(M log M) bits. The 
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discrete superposition model for networks was introduced in HI as an alternate deterministic 
model for approximating Gaussian networks. It was shown that the capacity of the Gaussian and 
the corresponding discrete superposition networks are within 0(M log M) bits. More importantly, 
it was proved in flU that any coding scheme for the discrete superposition network can be simply 
lifted to the Gaussian network with a loss of at most 0(M log M) bits in the rate. This reduced 
the problem of code design for Gaussian networks into the perhaps simpler problem of code 
design for the noiseless discrete superposition network, though the lifting procedure still had 
exponential complexity. However, the issue of designing a near-optimal code for the noiseless 
discrete superposition network remained elusive. The study of simple coding schemes for the 
discrete superposition network led to the linear network code presented here. 

The bounded gap approximation for capacity of relay networks was further improved in 
|[T2) and an extension of compress-and-forward to relay networks was constructed that was 
approximately optimal. Here the relays are required to perform vector quantization in order to 
compress their received signals. In lfT3l . the minimal compression rates for the relay nodes were 
computed, and the decoding procedure from 02) was further simplified. 

The capacity results for relay networks have spurred research in finding low complexity coding 
schemes. In [14J, the QMF scheme was modified by choosing low-density parity-check codes 
for encoding at the source and the relay instead of Gaussian codes. A simplified decoding 
algorithm based on Tanner graphs was presented and the viability of the proposed technique 
was shown via simulations. In 03), a different approach was taken by constructing codes that 
are computationally tractable when compared to QMF. A concatenated code was presented for 
the relay network where the outer code is a polar code and the inner code is a modification of 
the random Gaussian code from [2J. This approach was shown to have computational complexity 
that is near-linear in the block-length of the code. In [16J, the quantization and encoding in QMF 
is modified by using nested lattice codes at the source and relays. 

B. Network coding 

Network coding was introduced in the landmark paper [5]. The max-flow min-cut theorem 
was established for noiseless wireline networks with a single source and multiple destinations, 
which essentially implied that the cut-set bound was the capacity of these networks. A class of 
codes called /3-codes, which involved random encoding at the source and intermediate nodes, 
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were shown to achieve the capacity of the network with increasing block-length. There were 
other important contributions in 0: namely, the notion of distinguishability of codewords at 
the destination, which naturally introduces the notion of cuts in proving the achievability of the 
cut- set bound, and the technique of time-parameterizing cyclic networks to view them as special 
cases of acyclic networks. Both these techniques were used in the proofs in [[2l and are used 
in the proof of approximate optimality of linear network codes for Gaussian networks in this 
paper. 

Later, in ifTTl and lfT8l . it was established that linear network codes suffice to achieve the 
capacity of wireline networks. In IfTTl . algebraic tools were introduced that simplified the analysis 
and design of network codes. In [6], it was shown that random encoding at the source and random 
linear operations at the relay nodes achieves the capacity of the wireline network in the limit 
of increasing block- length of the codewords. In [2], the applicability of random linear coding 
to linear deterministic networks was shown, where random encoding by the source and random 
linear encoding by the relays achieves the cut-set bound in the limit as the block-length tends to 
infinity. One of the contributions in this paper is to show the applicability and near-optimality 
of random linear coding for layered Gaussian networks. 

C. Outline of the paper 

In Section [TTJ. we describe the network models. We define the Gaussian model for networks 
and also the discrete model, where the latter is obtained by quantizing the received signals in the 
Gaussian model and by restricting the transmit signals to a finite alphabet. Next we prove that the 
cut-set upper bounds on the capacities of the Gaussian and discrete networks differ in a bounded 



gap of 0(M log M) bits, with the bound independent of channel gains or SNR (Section III) 



The linear network code for layered discrete networks is presented in detail in Sec. IV The 
linear network code is shown to be approximately optimal and achieves rates within 0(M 2 ) bits 
of the capacity of the layered discrete network. In Sec. |V} the linear network code is proved 
to be approximately optimal for the layered Gaussian network and achieves all rates within 
a bounded gap of 0(M 2 ) bits from the capacity of the network, with the bound independent 
of channel gains or SNR. This will imply that the capacities of the layered Gaussian network 
and the layered discrete network are within a bounded gap of 0(M 2 ) bits. The linear network 
code is also approximately optimal for layered MIMO networks where the nodes have multiple 
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transmit and receive antennas (Sec. |V-B[), and for multicast networks where the source wants to 



transmit the same information to a subset of the other wireless nodes (Sec. V-C). We conclude 



by summarizing the properties of the linear network code in Sec. VI 



II. Network models 

First we describe the Gaussian model for wireless networks, followed by the discrete network 
that serves as the digital interface for the linear network code. 

A. Gaussian model 

We consider relay networks consisting of a single source and a single destination, which we 
model by the following Gaussian network. The network has a total of M + 1 nodes labeled as 
{0,1,..., M}, with being the source and M the destination. A directed graph describes the 
underlying topology of the network. The wireless link connecting two nodes i and j is described 
by a complex channel gain hij. All the transmit signals {xi} are constrained to be complex 
numbers with a unit power constraint. The received signal at node j is given by 

ieAT(j) 
where N(j) = {i ■ (i,j) G 6} is the set of neighbors of the j-th node, Zj is £/V(0, 1) complex 
white Gaussian noise independent of the transmit signals, and h^ = hijR + ihiji. 

We will also allow for MIMO relay networks where the nodes have multiple transmit and 
receive antennas. Suppose node i has Tj transmit antennas and £/,, receive antennas. In that 
case, the transmitted signal at each time instant at node i is a Tj-dimensional vector of complex 
numbers, and the received vector is LVdimensional. Correspondingly, the channel between two 
nodes i and j is then described by a collection of channel gains {h^j} where k and I index 
respectively the transmit antennas at node i and the receive antennas at node j. The received 
signal at node j is 

y l i= E 5>?K + 4 i = i,-. .,Ui, (2) 

ieMQj) fc=i 
where x\ is the signal transmitted from the /c-th antenna of node i, and y l , is the received signal 
at the /-th antenna of node j. zj is complex Gaussian noise added to the received signal at 
the j-th node. For simplicity of exposition, we restrict a transmit signal to satisfy an individual 
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power constraint, though our results can be extended to allow for a total power constraint across 
all the antennas of a node. 

B. Discrete model 

Now we describe a quantization-based digital interface for operating the above Gaussian 
network, i.e., for the purpose of defining the coding strategy. It is obtained, as the name above 
suggests, by quantizing the continuum valued signals received from the Gaussian network, and by 
restricting the choice of transmitted signals to lie in a discrete set. The quantization of the received 
signals is natural, though we will even discard sign information. We will call the overall network 
resulting from the discrete-inputs and discrete-outputs at each node as the discrete network. 

In the language of automata theory, this discrete network can be simulated (see Ifl9l0 from 
the Gaussian network, i.e., it uses lesser information than the original Gaussian network. Our 
key result consists of three dovetailed parts. The first part is that there is no significant loss 
in the cut-set bound, in that the bit gap between the upper bounds on the capacities of the 
two networks, the Gaussian network and the discrete network, is bounded over all SNRs. The 



second result, described in Section IV, builds on this first key result. The second part is that 
we identify a near capacity achieving coding strategy for the discrete network. The third part 
is that this coding strategy for the discrete network can be easily implemented on the Gaussian 
network, through simple quantization, and achieves rates within a bounded bit gap from the 
capacity. Thereby we obtain both a capacity approximating discrete network, a simple linear 
coding strategy for the discrete network, as well as a natural mapping of the coding strategy 
from the discrete network to the Gaussian network. Together, the combined results establish a 
discrete network for both analyzing the original network, as well as operating it near-optimally 
through simple quantization. It is therefore appropriate to call such a discrete network with these 
two rigorously established properties as a quantization-based digital interface. 

The discrete network is obtained by quantizing the received signals and constraining the trans- 
mit signals in a Gaussian network. The received and transmit signals are allowed to take finitely 
many values lying in what can essentially be regarded as a quadrature amplitude modulation 
(QAM) constellation. Define n to be 

n := max max{[log \h ijR \\, [log |%r|J}. (3) 
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The channel inputs (transmit signals) in the discrete network are complex valued, with both real 
and imaginary parts taking values from 2 n equally spaced discrete points. The transmit symbol 
is 

x = —(xn + ixj), (4) 

where 

n 

x R = ^2- fc x fl (A;), (5) 

fe=i 

Xl = ^2-^), (6) 

fc=i 

with each x R (i) and Xi(j) in F 2 . The symbol x can be equivalently represented by the 2n-bit 
binary tuple 

(x R ,Xj) = (x R (l),x R (2),...,x R (n),x I (l),x I (2),...,x I (n)). (7) 

Note that the above channel inputs satisfy a unit energy constraint at each discrete-time, and are 
therefore valid inputs even for the Gaussian network with a unit power constraint. This property 
will be helpful in proving the approximate optimality of the linear network code. 

The channel gains are unchanged from the Gaussian model. As in the Gaussian model, the 
channel between two nodes i and j in the discrete network simply multiplies the input x^ by 
the corresponding channel gain hij. At a receiver, the received signal is defined through the 
composition of the following operations: 

• First add all the incoming signals by the standard summation over C. 

• Then discard the signs of the real and imaginary parts. 

• Further discard the fractional portions of the real and imaginary parts and retain only the 
integer portion. 

• Then quantize the integer portion of the real and imaginary parts by truncating their binary 
expansions to n bits of precision, where n is as defined in ([3]). 

Thus y = y R + iyi is the received signal at a node in the Gaussian model, and we denote 
the binary expansions of the integer parts of \y R \ and \yj\ by J2h=i 2 fc 2/i?(&) and YlT=i 2 fc 2//(&), 
respectively. The received signal in the discrete network is then 

y ■= [y] ■■= \^2 k y R (k)j + 1 (i^^yiik)) . (8) 
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As with the transmit signals, y can be equivalently described by the 2n-bit binary tuple 

Hwii) = (l/fl(l) 5 l/fl(2),...,2/fl(n),l//(l),l/j(2),...,i/j(n)). (9) 

We will use the compact notation [•] to represent the overall quantization operation: 

y'j ■= [Vj] ■= [J2 h H x i + z j\- ( 10 ) 

It is important to note that each received signal in the discrete network can be obtained from 
the corresponding received signal in the Gaussian network by performing elementary quantization 
operations (when their transmitted signals are identical, as we intend to be the case). In fact, 
since the transmit signals in the discrete network are valid transmit signals for the Gaussian 
network, we use the same notation for the transmit signals in both models. 

In a similar way, one also obtains a discrete MEMO network corresponding to a MIMO 
Gaussian relay network. In the MIMO discrete network, each transmit and receive antenna can 
be treated as a virtual node. As before, every transmit and received signal (corresponding to 
every transmit or receive antenna) is quantized to lie in a finite set, and the granularity of the 
quantization will take into account all the channel gains between various antennas in the network. 
The transmit signals lie in a finite set and can be described by a 2umimo-^ tuple. The receive 
signals are quantized and described by a 2riMiMO-bit tuple, where 

riMiMO ■ = max max { [log | hf jR | J , [log | hf^ | J } . (11) 

(«j)e£ k=i,...,Ti, J J 

1=1,.. .,Uj 

III. The cut-set bound on the capacity of relay networks 
The cut- set bound 0, 112011 on the capacity C of a relay network is 

C < max min I{x n ;y n c\x n c), (12) 

p(x ,xi,...,XM-l) fieA 

where A is the set of partitions of {0, 1, ... , M} with the source G O and the destination 
M G f2 c , and {xj} and {yj} denote the transmit and receive signals. Additionally, in the Gaussian 
model, the inputs must satisfy an average power constraint, i?[|xi| 2 ] < l,Vi. Let CSg be the 
cut-set bound of the Gaussian network and let CSd be the cut-set bound of the corresponding 
discrete network. In this section, we start the program of approximating Gaussian networks with 
the discrete model by proving that the cut-set bounds of the two networks are within a bounded 
gap. The importance of this result will become clear in the later sections when we prove that the 
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cut-set bound is achievable for the layered discrete network up to a constant gap with a simple 
linear coding scheme, and the same scheme can be used on the layered Gaussian network by 
merely quantizing the signals. 

Lemma 3.1: The cut-set bounds of the Gaussian network and the corresponding discrete 
network are within a constant gap of 0(M log M) bits, 

\CS G -CS D \ = 0(M log M), (13) 

with the gap independent of channel gains or SNR. 



Proof: The lemma is proved in two steps: CSq > CSo in Section III-l and CSq — CSd 



0(M log M) in Section III-2 The procedure to prove both inequalities are the same. We consider 
a particular cut Vt in the network and choose an input distribution for this cut in one of the 
models. Then, in a series of steps, we transform the channel inputs and outputs of the cut to 
the corresponding channel inputs and outputs in the other model. We then bound the loss in the 
mutual information in this transformation. Repeating this procedure across all the cuts in the 
network completes the proof. 

1) CSg > CSd-' The transmit signals in the discrete network are a strict subset of the valid 
inputs for the Gaussian network, and the received signals in the discrete network are obtained by 
quantizing the corresponding received signals in the Gaussian network. Hence, as noted above, 
any operation in the discrete network can be simulated on the Gaussian network. 

Choose any input distribution for the transmit signals in the discrete relay network. Retain 
the same distribution for the inputs in the Gaussian network. Since the received signals in the 
discrete network are obtained by quantizing the received signals in the Gaussian network, for 
any cut f2, by the data processing lemma [|20l . 

I{xa;yci°\xci°) > I{xci]y'nc\xac). (14) 

This proves that CS G > CS D . 

2) CSg — CSd = O(MlogM): We show that the mutual information across a cut in the 
discrete network is at least as high as the mutual information across the same cut in the Gaussian 
network, up to 0(M log M) bits. 
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Step 1: Cut-by-cut analysis 

The cut-set bound for the Gaussian network is 

CS G = max min I{x n ;y n c\x n c) (15) 

p(x ,x 1 ,...,x M _ 1 ) f2eA 

< min max I(xn;yac\xnc). (16) 

OeA p(xq,xi,...,xm-i) 

We consider a particular cut VL/Vt c in the network. The received signal at the j-th node is given 
by 

Vj = ^ h ij X i + Z j- ( 17 ) 

ieN(j) 
The mutual information across the cut, X\ := I{xQ;yac\xQc), is maximized by the choice of 
jointly Gaussian inputs. 
Step 2: Positive fractional inputs 

Instead of the optimal joint Gaussian distribution for the inputs that maximizes X l5 we choose 
a different input distribution. Consider an input symbol x = -js(xr + ixi), where xr and xj 
are independent and uniformly distributed on (0, 1), i.e., xr and xj are positive fractions. Since 
_E[|x| 2 ] = 1/3, it satisfies the average power constraint for the Gaussian network's channel inputs. 
Each input in the network is chosen independently and with the same distribution as x, and is 
denoted by {x\ }. The received signals are denoted by {y, }. In Lemma 



A.l 



in Section 



A 



it 



is shown that the loss in the mutual information for this choice of inputs is O(M). The mutual 

<o\ (o) (2) 

information of this channel is X 2 := I(x n ; y^ c Pa c ) an< ^ compares to the channel in Step 1 as 

X x -1 2 = 0{M). (18) 

Step 3: Quantization of the received signal 
Next we quantize the received signal as follows: 

• Retain only the integer portions of the real and imaginary parts, and discard the fractional 
portions, which we denote by dS . The real and imaginary parts of d- have the same sign 
as the signal. 

(3) (3) 

• Discard the signs of the real and imaginary parts, which we denote by s-^ and s-j , 
respectively. 
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We denote the quantized received signal by y- ' . The mutual information across the channel in 
Step 2 can be rewritten as 

T ft (2) (3) ,(3) (3) I (2k nQ s 

x 2 — l [x n ,y nc ,a nc ,s nc \x nc ) {iy) 

^ T l (2) (3), (2k r / (2) ,(3), (2) (3k . uf (3k , om 

^ w (3) (3), (2k w (2) ,(3), (2) (3k . i nC i , 01 s 

Now the fractional part dk is given by adding the fractional part of the signal with the fractional 
part of the noise. 

ft (2).j(3)i (2) (3k _ , / ,(3)i (2) (3)\_t/j(3)| (2) (3) (2k n ~, 

< £/i(df)-/i(*< 8) ), (23) 

J6f2 c 

where 5- ' is the fractional part of the Gaussian noise. Since dh is fractional with E[\d- \ 2 } < 2, 
its differential entropy is upper bounded by that of a Gaussian distribution with variance 2, and 
so h(df) < log(27re). Since z, is distributed as CAf (0,1), h(zf ] ) = 0(1). Hence, 

/(^jdglsSUg) < |fi c |(log(27re)-0(l)) = 0(M). (24) 

So, defining X 3 := I(x^ ,y^ c \xq!), we have 

X 2 -X 3 = 0(M). (25) 

Step 4: Further quantization of the received signal 

Next we further quantize the received signal at the end of the previous step by restricting the 

binary expansions of its real and imaginary parts to n bits, and denote the result by ]h ', where 

n : = max max{ [log \h ijR \\ , [log |/i„ 7 |J }. (26) 

0,i)e£ 

Note that we have previously quantized the real and imaginary parts to be positive integers. 

Denote the discarded part of the received signal by dj . The mutual information of the channel 

in Step 3 can be rewritten as 

X 3 = I(x%W£,<®\x { £) (27) 

= I\ x n 5 Vn c \ x n c ) + -* \ x n !%l%)fe) (28) 

— *\ x n ^yQ c \ x n c ) Jt ~^\ x Q >%2 c Fn c )- (2") 
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We bound I(x n ;d n l\x n c) next. From the definition of n, \hij\ = \/{hf jR + hfjj) < 2 n+1 . 
Since \x\ \ < 1, | ^2 ieN tj)hijx\ I — M2 n+l . Hence, the binary expansion of the integer 
part of \^2 ieN (j)hijXl | has (n + O(logM)) bits. Since dS is the portion of the received 
signal that exceeds n bits of representation, it is easy to see that at most O(logM) higher 
order bits in the binary representation of Y^i^N(j)^ij x i influence dj . Therefore, we have 

J(:4 2) ;4 4) l45) = O(logM) and subsequently /(4 2) ;4 4 cl4 2 c) = 0(M log M). Now define 
X 4 := I(x ( ^;y^}\x^}) and we get 

X3-X4 = 0(M log M). (30) 

Step 5: Quantization of the transmit signals 

Next, we restrict the real and imaginary parts of the scaled inputs also to n bits. Let the binary 

expansion of y/2x\ R be O.Xi R (l)xi R (2) . . ., so 



'•;;; = -^=J2 2 ~ Px ^^- < 31 ) 



(4) 

° lR V2 



p=i 

(4) 



Similarly denote the binary expansion of y/2x\j. Define 

1 n 
x' iR := -j=^2x iR (p)2- p , 

* p=i 
1 n 

We will consider the new inputs x i := x iR + ix u and let the corresponding received signals 
under these inputs be denoted by y^ The mutual information here compares against that in Step 4 

as 

X 4 = I(xq ,x'ci',ync\xnc,%'(ic), (since x\ is only a function of x\ ') (32) 

, j, (2) ' (4) ' I (2) ' n. n ~, 

11 (2) ' ' I (2) ' \ . j, (2) ' (4) I (2) ' ' s ,o,s 

— i{x n ,x n ,y nc \x nc ,x nc )-\-i{x n ,x n ,y n c\x nc ,x n c,yQ C ) (d**) 

= 1 { x n'i fcl%) + -* \ x n ! x nj 1/q c Fn c 5 x n c 5l/Q c ) (35) 

< H^y'ncl^ + Hiy^ly'^). (36) 
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In ( |35] ), the first mutual information term is obtained due the Markov chain {x\ '} — > {x' { } — > 
{y'j}- Since 

< (2 n+1 )(2-") < 2, (38) 

we get 

| Y, h^xP-^l = O(M). (39) 

idN(j) 

Hence the binary representation of the integer part of | z2 ieN (j\ hij{x\ — xj| will have 0(log M) 
bits. This results in H(\yf - y'j\) = O(logM), due to which H(y$ \y' Qc ) = O(MlogM) 
bits. We obtain 

1 4 -I(x n ;y' nc \x' nc ) = O(MlogM). (40) 

Now observe that x i and y^ are the transmit and receive signals in the discrete network. Also, 
{Xj} are i.i.d. and uniformly distributed on their alphabet. In Step 1, we started with the optimal 
jointly Gaussian inputs for the cut in the Gaussian network and concluded in this step with 
i.i.d. uniform inputs for the discrete network. The total loss in the mutual information in this 
transformation is O(MlogM). Now define CSd as 

CS D := mm I(x n ; y' n c\x' n c). (41) 



CSd evaluates the right hand side of the cut-set bound ( |T2| ) for the discrete network for a specific 
choice of the input distribution. As a consequence of the above arguments, we have also proved 
that CS G -CS D = 0(M log M). ■ 



The choice of i.i.d. uniform inputs for the discrete network will be useful in Section IV-C 
IV. Linear network code for discrete relay networks 



In the Section II-B we have presented a quantizaton-based digital interface for operating a 
Gaussian relay network, which we have called the discrete network. It has the properties that it 
is obtained by merely quantizing the signals at each node in the Gaussian relay network, and that 
its cut-set bound on the capacity is nearly that of the original Gaussian relay network. Therefore 
any coding strategy for the discrete network that achieves rates close to the cut-set bound can 
be easily converted to a nearly capacity-acheiving strategy for the Gaussian relay network by 
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simply quantizing the signals. This motivates and sets the stage for the problem of determining 
nearly capacity-achieving coding strategies for the discrete network, which is the topic of this 
section. What we determine is that simple linear network coding on the bits of the en bloc finite 
representations of the received signals will suffice for the discrete network. The linear network 
code presented in the sequel achieves all rates within 0(M 2 ) bits of the cut-set bound of the 
layered discrete relay network, with the constant gap independent of channel gains and SNR, 
and only a function of the number of nodes in the network. 

A. Coding scheme for relay networks 

Any coding scheme for a relay network requires specification of the source's codewords, the 
functions applied each time to the received signals by the relay nodes, and the destination's 
decoding function. 

A (2^, N) code for a relay network is an encoding function for the source 

x : {1,2,...,2 NR }^X N , 

where X is the input alphabet of the channel, and a set of encoding functions for relay node k, 

9k, m : y m ~ l -> X, for m = 1, 2, . . . , N, k = 1, 2, . . . , M - 1, 

where y is the alphabet of node fc's received signal. For simplicity of exposition, we assume that 
the input alphabets of all the relay nodes are the same, and that so are their output alphabets. 
The symbols transmitted by a relay can depend on all the symbols previously received by the 
relay. The decoding function of the destination M is given by 

g M :y N ^{l,2,...,2 NR }- 

Let W be a random variable uniformly distributed on {1,2, . . . , 2^^} that corresponds to the 
message that source wants to communicate. Such a W is mapped to the codeword x (W). 
The average probability of error is given by 

Pe = Pr(g M (y M ) * W). 

The capacity of the relay network is the supremum of all rates R such that for any e > 0, there 
exists a block length N and a coding strategy for which P e < e. 
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Since the discrete network can be simulated on the Gaussian network, any coding scheme for 
the discrete relay network can be directly used on the Gaussian relay network. We formalize 
this notion in the following lemma. 

Lemma 4.1: A coding scheme for the discrete relay network can be lifted to the original 
Gaussian relay network with no change to its structure and no loss in the rate. 

Proof: The proof is based on the observation that the discrete network can be simulated on 



the Gaussian network. As mentioned in Sec. II-B the discrete network is obtained by constraining 



the transmit signals in the Gaussian network to a finite set and quantizing the received signals 
in the Gaussian network to a finite subset. Hence, any coding scheme for the discrete network 
can be directly employed on the Gaussian network by operating the Gaussian network on the 
quantization-based interface defined by the discrete network. This procedure does not require 
any changes to the coding scheme and does not decrease the rate of the coding scheme. This 
result can be extended to codes for MIMO relay networks and multicast networks. 

We have not explicitly defined a coding scheme for a general Gaussian network, but the above 
ideas are applicable to a general Gaussian network too. ■ 

B. Layered networks 

In the coding scheme described above, the encoding functions {gk, m } at relay nodes operate 
on a symbol-by- symbol basis and can vary with time. We restrict our attention to layered discrete 
networks, for which the encoding functions {gk, m } at the relay nodes can be simplified. In a 
layered network [2J, as the name suggests, nodes are divided into groups or layers. The nodes 
of one layer can only transmit to nodes of the subsequent layer. The source node is the sole 
node in the zeroeth layer, and the last layer, layer L, contains only the destination M. All other 
relays are divided among the intermediate layers. The nodes in layer k are denoted by £, k . An 
example with four layers of nodes is shown in Figure[T]with £ = {0}, C% — {1, 2}, £ 2 = {3, 4}, 
£ 3 = {5}, and L = 3. 

In a layered network we can restrict attention to the following simplified block-by-block 
coding scheme, where each block consists of N symbols. Consider a (2^, N) code for the 
relay network, and suppose that the source transmits a codeword of length N, which we call a 
"block." Now let all the relays in C\ buffer their received signals for N time units. Subsequently 
each relay generates iV-length transmit vectors, i.e., blocks, as a function of their received vectors. 
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Fig. 1. Example of a layered network. 

This is possible since only the source transmits to the relays in £ ls and these relays generate 
their transmit symbols causally as a function of their previous receptions. Similarly, each node in 
Ck waits for the nodes in Ck-i to complete their transmissions, buffers the N received signals, 
and then transmits a block of length N. Finally, the destination receives the block of N symbols 
and attempts to decode the source's transmission. There is no loss in the capacity in operating 
on a block-by-block basis. The source can start the transmission of the next codeword once it 
completes the transmission of the first codeword. Also the relays in C\ are full-duplex, they 
transmit and receive at the same time. So the relays can continue receiving the next block of 
symbols while they transmit a block of symbols. The destination will continue to receive blocks 
of symbols one after another. In this way, we achieve a constant rate R of communication in 
the network. 

C. Linear network codes for the discrete relay networks 

The linear network code we propose is constructed on the discrete network. As described in 



Section II-B[ the transmit and receive signals in the discrete network are quantized to lie in a 



finite set of complex numbers consisting of real and complex parts in [0,1], each with only n 
bits of precision. 

Concerning transmissions, the transmit signal Xi = Xm + ixn has real and imaginary parts 
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given by 

x it R = }^2- k x itR (k), (42) 



n 



k=\ 



x hI = ^2- fc x i)J (A;). (43) 



fc=i 
with each x i: n(i) and x i: j(j) in F 2 , and where 

n : = max max{ [log \h ijR \\, [log | hiji | J } . (44) 

The received signals in the discrete network {$/•} are obtained by quantizing the received 
signals in the Gaussian network. The Gaussian received signal is quantized by discarding the 
signs of the real and imaginary parts, further discarding the fractional portions of the real and 
imaginary parts and retaining only the integer portion, and finally quantizing the integer portion 
of the real and imaginary parts by truncating their binary expansions to n bits of precision. If 
Vj = Vj,R + l Vj,i i s tne Gaussian received signal, and we denote the binary expansions of the 
integer parts of \\)^r\ and \yjj\ by YlkLi 2 fc 2/j,i?(^) an d Y^k=i 2 k yj,i(k), respectively, then the 
received signal y ■ in the discrete network is 





V 3 = [Vj] ■= [ l^Vy itR {h) ) +^ | ^2VW ) • (45) 

Hence we regard the received signal in the discrete network as a binary tuple 

(yjA 1 ),vj,r ( 2 )> • • • :%,bW, vj,i( 1 ), yj,i( 2 ), ■ ■ -,yj,i( n )) 

of length 2n where each entry is or 1 . Similarly, the transmitted signal in the discrete network 
can also be so regarded. In the rest of this paper, we reserve the phrase "binary 2n-tuple" or 
"2n-tuple" to describe such a vector, which can be converted in a straightforward fashion to a 
complex symbol for transmission, or can be obtained in a straightforward fashion from a complex 
symbol that is received. 

After N such symbols have been received, there is a 2niV-length binary vector that is the 
received block, and similarly there is a 2niV-length binary vector that is the transmitted block. 
We will reserve the phrase "binary 2niV-vector" or "2niV-vector" to refer to such a vector which 
represents a block of N received symbols, either for block transmission or after block reception. 
We will represent a 2niV-length received or transmitted binary vector at a node j using an 
underbar, as in y . or x -, respectively. 
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Fig. 2. Linear encoding at node j. 



We will employ a linear coding scheme where the transmitted block is simply obtained by 
multiplying the received block by a 2nN x 2nN matrix of Os and Is. 

The overall coding scheme is randomly generated, and is simple to describe. 

1) Source's codewords/ There are 2^^ messages. Using a uniform distribution on binary 2n~ 
tuples, the source randomly generates a set of 2 NR codewords, each of length 2nN, where each 
codeword is constructed by independently picking N binary 2n-tuples from the uniform distri- 
bution. Note that each codeword corresponds to N complex symbols, which are transmitted over 
iV discrete time instants. The choices of the rate R and the block-length iV are elaborated later 



in the proof of Lemma 4.2 in Section IV-D The source transmits the codeword corresponding 
to the particular message that has been chosen for communication. 

2) Relay's linear mappings/ Relay j randomly chooses a 2nN x 2nN binary matrix A,-, 
by independently picking each entry as either or 1 with equal probability. This will be the 
matrix representing the linear code at node j. The relay buffers iV received binary 2n-tuples, 
and adjoins them to construct a binary 2nN- vector that constitutes the received block y .. It then 
multiplies this binary 2niV-vector by Aj to obtain a binary 2niV-vector x ■ that constitutes the 
transmit block. It splits this binary 2nN- vector into N binary 2n-tuples. Converting each of the 
binary 2n-tuples back into complex numbers, with the real and imaginary parts, each in [0, 1], 
and each of n-bit precision, gives iV complex symbols. These N complex transmit symbols 
are transmitted by the relay node over N discrete-time instants. An example of the encoding 
operation at node j is shown in Figure [2} 

All the M — 1 relays independently construct their binary encoding matrices in a similar 
manner as described above. 

Next, to define decoding, we define strong typicality of vectors: 

Definition 1: A vector x E X N is defined to be e-strongly typical with respect to a distribution 
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p(x), denoted by x G % >p , if 

\v x (x) — p(x)\ < €p(x), WxEX, (46) 

where e G 1Z + and u x (x) = h\n : x n = x\ is the empirical frequency. 

This is extended in the standard way to include joint strong typicality of vectors (see Il20l0 . 
Vectors x = {x\, x 2 , . . . , x N ) G X N and y — (y x , y 2 , . . . , yjv) G 3^ are e-jointly strongly typical 

with respect to the distribution p(x,y), if z — (zi,z 2 , ■ ■ ■ ,z N ), where each Z{ = ^ s 

V Vi J 
jointly typical with respect to the distribution p(x, y). We employ the following definitions from 

Il23ll , slightly modifying them to include the linear operations. 

Define a singleton set Xo(w) := {x (w)} for every message w. 

Now consider a node j in C\, the first layer. Define the set of received vectors at node j 
associated with a message w as 

y'jM ■= {^:(^XoH)eU (4V) 

with p denoting the distribution p(xo,Vj) where xq is uniformly distributed and p(yj\xo) models 
the channel from node to node j in the discrete network. 

Continuing to consider the node j G C\, we define the set of transmit vectors at node j that 
are associated with the message w as 

Xj(w) = {xj : Xj = Ajiy.), where y. G y'^w)}. (48) 

Next consider a node j G Ck with k > 2. Noting that any i G M{j) belongs to layer £fe_i, 
we recursively (in k) define its set of received vectors at node j associated with message w as 

y'j(w) := {y. : (y., {Xijiztfij)) G Tej,, for some x { G Xi(w), for each i e JV(j)}. (49) 

The distribution p above is p({xi}ieAfj,y'j), where {5i}j e jv"(j) are independent and uniformly 
distributed, and p(y'j\{xi} ie x(j)) models the channel from the nodes in Af(j) to j in the discrete 
network. We also define the set of transmitted vectors at node j associated with the message w 
as 

Xj(w) := {xj : Xj = Ajiy.), for some y. G y'^w)}. (50) 

Definition 2: We write (y .,w) G T e if y . G yAw). 
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3 ) Decoding at the destination: The destination receives the 2niV-vector y , and decodes 
by searching for a message w such that (y ,w) E T e . If it finds more than one such message 
or if it finds none, it declares an error. Else it declares the unique choice as its estimate of the 
source's message. 

It should be noted that though the relay's encoding is a linear operation over the block of 
2niV-vectors, the end-to-end channel from the binary 2niV-vectors transmitted by the source to 
the block of binary 2niV-vectors received by the destination is not linear over the binary field, 
and we cannot describe decoding as just inverting a binary matrix 2nN x 2nN matrix. This is 
because of the truncation operations that are an integral part of the very definition of the discrete 
network. 

D. Computing the rate achieved by the linear network code 



Next we compute the rate achievable by the linear network code. In ( |4Tj ), we defined CSp as 
the cut-set bound's value for the discrete network for a specific choice of the input distribution. 

Lemma 4.2: The linear network code achieves all rates within a gap of 0(M 2 ) bits of CSd 
for layered networks, where the 0(M 2 ) gap is independent of channel gains or SNR. 

Proof: 



Probability of error: The probability of error for the coding scheme (see Section IV-A for the 



notation) is given by 

P e = PT(g M (y M ) ± W) (51) 

2 iV-B 

= ^rJ2 P <9m(i m ) + w\W = w). (52) 

w=l 

As is standard, since P e is symmetric in the transmitted message, we assume wlog that W = 1 
and evaluate the probability of error when the first codeword is transmitted, P e |w=i. 
Error events : The possible error events at the destination M are 

• E Q : One of the transmitted or received vectors in the network is not strongly typical. That 
is y . ^ T e ,p, where p denotes the distribution induced on y, by the uniform distribution on 
each Xi for i E A/"(j), and by the conditional probability p(yj\{xi}ieAf{j)) describing the 
channel in the discrete network from the nodes in J\f(j) to y-. 
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Fig. 3. Example of a cut O in the network in Figure [T] Here Q — {0, 1, 3} and Q c = {2, 4, 5}. 



• E w : (y M ,w) E T e , where w ^ I. 
By applying the union bound, 

Pe\w=i < Pt(E ) + Pt(E 1 AE c ) + Pt(\/ E W AE C AE1) 
From Lemma |B.1| (see Section [B]), for any e > 0, 



Pt(Eq) + Pr(Ei) < e, for A^ sufficiently large. 



Therefore, 



e W: 



2 NR 

-i < e + yPr(E v 



w=2 



(53) 



(54) 



(55) 



where E w := E w A E c A E\. 

Error event E w : Let £fe be the set of nodes in layer k. We say that a node j G £fc is confused\ l \ 
by u> if (y'.,w) G T e for some w^l. The destination is confused by w under £^ w . The source 
is not confused by definition. Hence, under the error event E w , the nodes in the network get 
separated into two sets, ones that are confused by w, and others that are not confused by w. 
This is a cut in the network; see Figure [3] for an example. 
Fix an arbitrary cut fi, and define the sets (see Figure |4]) 



F k n ■= C k nn and G k n := C k n ^ c . 



(56) 



'This is similar to the notion of distinguishability used in |5] and (2). 
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Fig. 4. Layers (k — 1) and k in a network. The wireless links connecting the nodes are not shown. Each oval represents a 
particular set. The dashed curve in the center divides the network into Q and fl c . 



Let us denote the concatenation of the received 2niV-vectors of nodes at layer £ by y . 
Or, in greater detail, if ii,i 2 , ■ ■ ■ ,i q are the nodes, in lexicographic order, that are present in 
layer £, then y := (y. ,y. ,...,y. ). Correspondingly, we also define the concatenation of 
transmitted vectors at layer £ under y by x c (y ) := (A h (y. ),A i2 (y'. ),..., AAy. )). Thereby 



^c, 



^c. 



—n' 



we define the concatenated received vectors y r ,y r ,---,y r at the L layers in the network, 
and also the corresponding concatenated transmitted vectors. Next we define the network-wide 



received vector y 



{y'r 



v, 



y ), with receptions ordered according to layers, and 



lexicographically within layers. The corresponding network-wide transmitted vector is x v (y ) : = 
(x Cl {y Ci ), x C2 {y C2 ),..., x Cl (y CL ) ) . 

We now wish to define the subset of network-wide received vectors that are consistent in the 
sense of joint typicality with a source message 1. Similar to the definition of yAw) in ( [47] ) and 



( |49| ), we define received vectors at a node j that are associated with message w, except that 
we now do it at a layer and across the whole network. We can do this by exploiting the layered 
nature of the network where nodes at one layer only transmit to the nodes in the next layer. 
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We say a network-wide received vector y = (y ,y ,. . . ,y ) is associated with message 
w if for each £ and each node j G Hi, (y ., x Cl _ 1 {y c )) G % tP , where p is the distribution 
p(xc e _ 1 ,yj), where Xc t _ t is uniformly distributed, and piVjlxct^) is the conditional distribution 
describing the channel from transmissions by nodes at layer £ — 1 to node j. We denote by 
yy{w) the set of such network- wide received vectors associated with message w, and denote 
by Xv{w) the set of network- wide transmitted vectors. If the source transmits message w, then 
with high probability, for a sufficiently large N, by the asymptotic equipartition property (AEP) 
of strongly typical vectors ll20l . the random network- wide vectors y will lie in yv(w). 

We note that the codewords corresponding to the messages are random, as are the matrices 
{A,}, due to the random coding strategy. As a consequence, 3^v are a l so random, since they 
are dependent on the codewords and network coding matrices. Now we proceed to conduct an 
analysis under the random coding strategy. 

Randomly choose a codeword for each message, and a set matrices {Aj : 1 < j < M — 1}, 
and randomly, with a uniform distribution, pick a network-wide received vector y from the 
set 3^v(l)- F° r the random coding strategy, define Q™ n as the event that every node in Gk,n is 
confused by w, i.e., for a node j G G^q, 

(feHW^!,^) e %#, for some x^w) G Xi(w), for all i G C k -i, (57) 

with p denoting the distribution p({£»}ie/vx?))i/i)> where {xi^isNij) are independent and uni- 
formly distributed and p(y,-|{#i}ie/v"C?)) models the channel in the discrete network. Define .7-™ n 



as the event that no node in F kn is confused by w. Note that, in ( [57] ), due to the random coding 
strategy, Xi( w ) is also random. 

For a random choice of y , let Vi, Wt n denote the probability that no node in fi is confused 
by w when 1 is the transmitted message, and all the nodes in tt c are confused by w. Then, 



*W = Pr(A^nA^n) (58) 

fc>0 

fe-i fc-i 

= II Pr (^ I A ^ A Sfr) Pr(^n l^n A A ^ A Ofa) (59) 

fe>l «=o 1=0 

fe-i 

< H Pr (^iA^ A ^)- ( 6 °) 



fe>l Z=0 
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Now we make two important observations concerning each relay node j's buffering of N 
of its received symbols and multiplication of such a buffered 2niV-vector by the matrix A,-. 
First, since the matrix A,- is independently chosen for each relay, all the relay mappings are 
independent of each other. 

Second, when we randomize over all the linear encodings Aj at relay node j, every transmit 

vector x_j E Xj( w )' f° r an Y w > is independently and uniformly distributed over the set of binary 

vectors of length 2nN. Hence each symbol in a transmit vector x^ is independently and uniformly 

distributed. 

Error event at layer k: Next, define Vi wn as the contribution of the /c-th layer to the probability, 

i.e., 

fc-i 

H»fl ■= Pr (^i A ^ A g w)- (6i) 

1=0 

Consider the transmit vectors of node i E Cu-i given by x { = Aijj.. Under the conditioning 
events to determine V^ wn , we know that x { E Xi( w ) f° r i £ Gk-i,n, and x_ t E Xi(X) f° r 
i E Fk-i,ci- Denote the transmit vector of node i E Gk-i,n by x^w) to indicate that it is the 
transmit block of a confused node, and denote the transmit vector of node i E -Ffc-i,n by Xj(l), 
since it is not confused by any other message. In order to compute V\ w n , we need to compute 
the probability that for every node j E Gk,n, 

(fe(w)}i6/: fc -i'^) G ^,P' for some ^i( w ) G Xi(w), for all i E £ fc _i, (62) 

where pis the distribution p{xc k _ l - l y J ), where {xi}i^c k -i is uniformly distributed, andp(y J |x£ fc _ 1 ) 
is the conditional distribution describing the channel from transmissions by nodes at layer k — 1 
to node j, given that 

• x F _ oW was transmitted by the nodes in Fk-i t n, 



x 



Gk- 



in (w) was transmitted by the nodes in Gk-i,ci, and 



({Xi(w)} ieGk _ in ,y) E T e , p , where p is the distribution p({x i } ieGklsv y' j ). 



-3 



Project xy(u>), the set of network-wide transmit vectors, onto the nodes in £k-i to obtain the 
set xc k -i( w )- pick A-i-wide transmit vectors x Ck _ x = {^(w)} ie/ ; fc _ 1 from xc k -A w )- If node 3 
checks to see if this choice of xc k _ x will result in (x,c fc _ 1 ,y.) lying in T e , p , and x^w) = x^w), 
for all % E Gk-i,a, then, under the conditions listed above, it will find a positive answer with 
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probability less than 

2- NI i x F k _ 1 ^y' J ,x Gk _ lsl ) _ 2~ NI{xF k-i,n>y'j\ x G k - U n\ (63) 



where the uniform distribution on {sj} is used to evaluate the mutual information and ( [63] ) is 
due to the independence of transmit symbols {xi}. The choice of the i.i.d. uniform distribution 
for {x^ follows from the discussion following ( [6Q| ). Instead, for the set x c _ , if x^w) ^ x t (w) 



for all i G Gk-x,ci, then the probability of ( |62| ) is 



2-^(^_i, n ^G fe _i,ni^') < 2~ NI( - XFk - 1 . n ' y '^ XGk - 1 ' n) . (64) 

For all other choices of {x i (w)}i G £ fc _ 1 G xCk-i(. w )> when if Xj(iy) 7^ x { (w) for node i in a subset 
of Gfc_i,n, the probability of ( [62] ) is similarly upper bounded by 

2-" /V7 ( a **-i,ni^>G fc _ lin )_ ^ 



From Lemma 



C.l 



(see Section O, Ix/^fV)! = 2°^ M ^ N . So, we apply the union bound with 
respect to all the vectors in X£k-i( w ) to get 

fc-i 
Pr(node j is confused by w\ A J 7 ^ A £™n) 

1=0 

< 2 0(M)N 2- NI ^ XF k-i,n' y '^ XG k-i,n\ (66) 

We bound V\ wVL as 

fe-i 
^i,«,,n = Pr(nodes in G fc , Q are confused by iu| /\ 7£ li0 A 0£ 1>n ) (67) 

2=0 
fc-i 

< J] Pr(node j is confused by w\ /\ Jf_ l>n A ^-1,0) (68) 

jeG fe , n 1=0 

< 2 ( M )l G '=^l Ar 2~ iVEjGGfc . n7(a:Ffe - 1 ' n; ^ |a:Gfc - 1 ' n:) (69) 

< 2°( M )l G fc,«l Ar 2~ 7V7(:rFfe - 1 ' n;?/ ^.« |:!:Gfe - 1 ' n) . (70) 



To obtain the bound in ( [68] ), we note that under the conditioning events, the nodes in £&_ 



transmit x c _ = {x i } ie c k _ 1 - Given a set of transmissions, x c , the received vectors y . and 
y . at node j\ and j 2 , respectively, in layer k are conditionally independent. We obtain the bound 



by noting that the conditioning events in ( [68] ) involve many such mutually exclusive transmissions 
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by the nodes in Ck-i- The bound in (|69]) is from ([66]). With a lexicographic ordering of the nodes 



in Ck as {ji,j2, • • • , 3\G kn \\> me bound in (|70]) is obtained as 

I Gfe.nl 

IGfe.nl 

= 5Z ^(^iil^fc-Lo) - H(y' n \xc k ^) 
1=1 

I Gfc.nl 



■Jm 



Z=l 



(71) 

(72) 
(73) 



where d72b follows from the Markov chain (Lt=ij/. ,££.,) — >■ ar/v, — ► y', , and that condition 

— 3l * -^ 

ing reduces entropy. 



Back substitutions : Substituting from f70| ) in ( |60| ), we get 

V lw n < (2 0{M) ^ k >^ Gk - nlN )(2' N ^ k - lIixFk - 1 - n,y ' Gk ^ XOk - 1 - n) ' 
< 2 0( - M )\ nc \ N 2~ NI{xn ' y '^ Xnc ' ) 
_ 2°( m2 ) n 2~ NI ^ Xn ' y '^ XQC \ 



(74) 
(75) 
(76) 



The bound in (75) follows from the steps below and the chain rule for mutual information. 



I ( x F k _ 1 ^ y' Gk a\ x G k -i, 



H (y'G h J X Gk-i,n) ~ Hiy'^Jxc^) 

ff (4,J x Gn,J - H (y' Gh J x vi ^tly Ge 



fc-i. 



> H (VG k . a l^nc, U^W) - ff (y G . _ |x v , U*Jy a 



fc-1./ 



e,Q' 



> Hxn; yG h J x te, u t=iyG in ) 



(77) 
(78) 
(79) 
(80) 



where the (78) uses the Markov structure of the layered network, 



k4-i.Uf = i4, fl ) -> ^k-! "► ^/G fc .o- 



The probability of the event E w is bounded as 

Pr(^) < J2 Vl >™> 



< \^2 0(M2)Ar 2 _Ar/(:!:n; ^ c|:2:nc) 

< 2 M+1 2°( A/2 ) Ar 2~ iVmin «eA 7 "( x n'S'n':l a: ^) 



(81) 
(82) 
(83) 
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Finally, substituting the bound for Pi(E w ) from ( |83| ) in p5\ gives us 

P | w -i < e + (2^ — 1)2 A/+1 2°( M2 ) Ar 2~ Arminf2GA/ ^ Q;y n c ' ;Enc ^ (84) 

< e _i_ 2 A/+1 2 Ar (- R + °( M2 )~ minneA/ ( :rn;y n c l Xnc )) (85) 

Hence, if R < mm neA I(xn; y' nc \xQc) — 0(M 2 ), then P e |w=i ° r -Pe can be made arbitrarily 
small for a sufficiently large N . ■ 

£. Approximate optimality of the linear network code for layered discrete networks 

Next we prove that the linear network code is approximately optimal for the layered network. 

Theorem 4.3: The linear network coding scheme achieves the capacity of the layered discrete 
network up to a bounded number of bits, i.e., the rate R achieved by the coding scheme is 
bounded from the capacity Cd of the layered discrete relay network by 0(M 2 ) bits, 

C D -R = 0(M 2 ) } (86) 

where the constant gap is independent of channel gains or SNR. 



Proof: In Lemma |3.1[ we proved that the cut-set bound of the Gaussian network and its 

we 



discrete counterpart are within a bounded gap of 0(M log M) bits. In fact, in Section III-2 



also showed that CSd = CSg — 0(M log M), where CSd is the value of the cut-set bound of 



the discrete network for the specific choice of the uniform i.i.d. input distribution (see (41 )) and 



CSg is the cut-set bound of the Gaussian network. In Section III-l we proved that CSg > CSd, 
where CSd is the cut-set bound of the discrete network. Therefore, CSd = CSd — 0(M log M). 
Since CS D > CS D , we have 

\CS D -CS D \ = O(MlogM). (87) 



From Lemma 4.2 we know that the linear network code achieves a rate R within 0(M ) bits 



of CSd- Hence, along with the bound in ( [87] ), the linear network code achieves rates within 
0(M 2 ) bits of the cut-set bound CS D - Since CS D > C D , the theorem is proved. ■ 

V. Linear network code for layered Gaussian networks 

In the earlier sections we proved that the cut-set bounds of the Gaussian and the discrete relay 
networks are within a bounded gap of 0(M log M) bits. Later we developed a simple linear 
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coding scheme for the layered discrete network that is approximately optimal. Also, the discrete 
model is a digital interface for operating the Gaussian networks; the signals in the discrete model 
are obtained by quantizing the signals in the Gaussian model. Combining these results we obtain 
a nearly capacity-achieving coding strategy for layered Gaussian relay networks, which consists 
of (i) quantizing received signals, and (ii) collecting a block of such signals and applying linear 
network coding on the overall vector of bits. 

In this section, we prove the optimality of the linear network code for Gaussian relay networks 
and later extend this to MIMO Gaussian relay networks, where the nodes can have multiple 
transmit and receive antennas, and to multicast networks, where the source can transmit the 
same information to a subset of the nodes. 

A. Approximate optimality of the linear code for layered Gaussian relay networks 

Theorem 5.1: The linear network coding scheme achieves the capacity of the layered Gaussian 
relay network up to a bounded number of bits, i.e., the rate R achieved by the linear network 
code is bounded from the capacity Cq of the layered Gaussian relay network by 0(M 2 ) bits, 

C G -R = 0(M 2 ), (88) 

where the constant gap is independent of channel gains or SNR. 

Proof: The linear network code is constructed over the layered discrete relay network 



as described in Section IV and can be used on the Gaussian relay network as mentioned in 
Lemma 14.11 

From Theorem 4.3, we know that the linear network code achieves all rates within 0(M 2 ) 



bits of the cut-set bound of the layered discrete relay network CSjj. In Lemma 3.1 , we proved 
that \CS G - CS D \ = 0(M log M). Combining these results, and noting that CS G > C G , we 
get the statement of the theorem. ■ 

The theorem establishes that the discrete model can serve as a digital interface for Gaussian 
networks, since a coding scheme for the layered discrete relay network involving simple linear 
operations is approximately optimal and can be used on the layered Gaussian relay network. 
Since the linear network code achieves rates within a bounded bit gap of CS G , the theorem is 
also a proof of the near-optimality of the cut-set bound for layered Gaussian relay networks, 
though this was proved earlier in for a smaller bounded gap of 0(M log M) bits. 
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As a consequence of the above theorem, we can also prove the following lemma. 

Lemma 5.2: The capacity of the layered discrete relay network and the capacity of the layered 
Gaussian relay network are within a bounded gap of 0(M 2 ) bits, i.e., \Cq — Cd\ = 0(M 2 ). 
Furthermore, a near-optimal code for the layered discrete relay network is also a near-optimal 
code for the layered Gaussian network. 

Proof: The linear network code is near-optimal for both the layered discrete relay network 
and the original layered Gaussian relay network and achieves rates within 0(M 2 ) bits of the 



capacities of both the networks (see Theorems 4.3 and 5.1 ). Hence, it follows that the capacity 
of the layered Gaussian and the layered discrete network are within a bounded gap of 0(M 2 ) 
bits, i.e., \C G -C D \ =0(M 2 ). 

Next consider a near-optimal code for the layered discrete network that achieves rates within 



a SNR-independent gap from the capacity of the discrete network. From Lemma 4.1, we can 
lift this coding strategy to obtain a code for the original Gaussian relay network without any 
decrease in the rate. Since the capacity of the two networks are within a bounded gap, the lifted 
code is a near-optimal code for the layered Gaussian relay network. ■ 

B. MIMO relay networks 

We can extend the linear network coding scheme to layered MIMO networks, where nodes 
have multiple transmit and receive antennas. In (|2]), we defined the received signal in a MIMO 
receiver in a Gaussian relay network. We operate the MIMO Gaussian relay network on the 
discrete interface as described in Section III-BI The linear network code is defined on the MIMO 
discrete relay network. The basic ideas in the coding scheme remain the same, but with some 
modifications to accommodate multiple antennas. The details are: 

1) Source's codewords: The source constructs a set of 2^^ codewords of length N. Every 
codeword is a T x N matrix, where each row of the matrix is transmitted by one of the transmit 
antennas of the source. Each entry in the matrix is a complex number from the QAM constellation 



(or equivalently, a 2n M ihio -length binary tuple, see Section |II-B[ ) that is independently chosen 
with the uniform distribution. 

2) Relay's mappings: Relay j has Uj receive antennas, receives a vector of Uj symbols every 
instant, and buffers N such vectors. The received binary tuples are adjoined to construct a 
2riMiMoNUj-\ength binary vector. The relay constructs a 2umimoNTj x 2iimimoNUj binary 
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matrix A^mimo where each entry of A^mimo is either or 1. It multiplies the 2n MIMO NUj- 
length binary vector with Aj >M imo to obtain a 2riMiMoNTj-length binary vector. It splits this 
vector into Tj vectors of length 2umimoN . Each tuple of length 2umimoN corresponds to TV 
transmit symbols for a particular transmit antenna. 

3) Decoding at the destination: Destination collects N received signals and finds a message 



that is associated with the received vector; see the decoding procedure in Section IV-C3 for 
details. If it finds more than one message that satisfies this condition or if it finds none, it 
declares an error else it declares the unique transmitted message. 

Let the maximum number of transmit or receive antennas in the network be T max . 

Theorem 5.3: The linear network coding scheme achieves the capacity of the layered MIMO 
Gaussian relay network up to a bounded number of bits, i.e., the linear code achieves all rates 
R bounded from the capacity Cg,mimo of the layered Gaussian relay network as 

C g ,mimo-R = 0(M 2 T max ). (89) 

The constant in the bounded gap is independent of channel gains or SNR. 

Proof: The steps in the proof are essentially the same as that of Theorem |5.1| 

1) Operate the MIMO Gaussian relay network on the digital interface defined by the discrete 
model. 

2) First we prove that the cut-set bound on the capacity of the MIMO Gaussian relay network 
and the MIMO discrete network are within a bounded gap of 0(MT m3iX log MT max ) bits. 



Here, the proof of Lemma 4.2 can be reused by a simple observation. While evaluating 
the cut-set bound, we can view each transmit and receive antenna as a virtual node. Hence 
the total number of nodes in the network is at most MT m3iX which gives us the necessary 
bound. 
3) The coding scheme achieves all rates within 0(M 2 T max ) bits of the cut-set bound of the 
discrete MIMO network evaluated for a specific choice of the input distribution. Here the 
input distribution is i.i.d. across all the transmit antennas in the network and is uniform over 



the alphabet. The arguments in the proof of Lemma 4.2 carry over to MIMO networks by 
noting that due to multiple transmit and receive antennas, the receive and transmit signals 
are complex vectors instead of scalars. In the proof, the value of |%i(iu)| for MIMO 
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networks will be different than before. For MIMO networks, the bound on H(y \x Cl ) m 
(fTT6]) increases to T max |£i| O(N). Due to this, the bound on |*(w)| is 2°( MT ™^ N . With 



this, we get the required bounded gap of 0(M 2 T max ) from the cut-set bound. 
4) Then it is straightforward to prove near-optimality of the linear code for MIMO Gaussian 
networks; see Theorem |5.1[ 



We state the counterpart of Lemma 5.2 for MIMO relay networks. 

Lemma 5.4: The capacity of the layered MIMO discrete relay network and the capacity of 
the layered MIMO Gaussian relay network is within a bounded gap of 0(M 2 T max ) bits, i.e., 
\Cg,mimo~Ci),mimo\ — 0(M 2 T max ). Furthermore, a near-optimal code for the layered MIMO 
discrete relay network is also a near-optimal code for the layered MIMO Gaussian network. 



Proof: Same as the proof of Lemma |5.2| ■ 

C. Multicast networks 

In a multicast network, the source node wants to communicate the same information to a 
subset of the nodes (instead of a single destination as in the previous sections). The remaining 
nodes which are not the intended recipients act as relays. Let V be the set of nodes that are the 
intended recipients of the source's message. Then the cut-set bound on the capacity Cq,mvU for 
such networks is given by 

C G ,Muit < max min min I(xn;yn°\xcic), (90) 

p(x ,x 1 ,...,x M _ 1 ) D£T> U 6 A D 

where A D is the set of all cuts in the network that separate the source from the destination D. 



We can extend the linear network code from Section |IV-C| to this class of networks, with the 
only difference being that all the intended destinations in V decode the source's transmission. 
In a layered multicast network, the destinations in V can be spread across the various layers in 
the network. 

Theorem 5.5: The linear network coding scheme achieves the capacity of the multicast layered 
Gaussian network up to a bounded number of bits, i.e., the linear network code achieves any 
rate R which is bounded from the capacity Cg,m u u of the multicast layered Gaussian network 
as 

C G ,Muit-R = 0(M 2 ). (91) 
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The constant in the bounded gap is independent of channel gains or SNR. 



Proof: The proof of this theorem resembles that of Theorem 5.1 We give a general outline 
of the proof and skip the details. 

1) Operate the multicast Gaussian network on the digital interface defined by the discrete 
model. 

2) First we prove that the cut- set bounds on the capacity of the multicast Gaussian network 
and the multicast discrete network are within a bounded gap of 0(M logM) bits. Though 
the cut-set bound for multicast networks is slightly different from that of relay networks, 



the proof of Lemma 4.2 can be reused by individually comparing the cut-set bound between 
the source and each destination in V. 
3) Then we prove that the coding scheme achieves all rates within 0(M 2 ) bits of the cut-set 
bound of the discrete network, when the cut-set is evaluated for i.i.d. uniform inputs. We 



re-use the proof of Lemma 4.2 with the only difference being that multiple destinations 



want to decode the source's transmission instead of a single destination. 



4) With the above arguments, we can prove the equivalent of Theorem 4.3 for layered 
multicast networks. Then, with the same arguments as in the proof of Theorem 5.1[ we 
can prove the near-optimality of the linear code up to 0(M 2 ) bits for the original layered 
multicast Gaussian network. 



We state the counterpart of Lemma 5.2 for multicast relay networks. 

Lemma 5.6: The capacity of the layered multicast discrete relay network and the capacity 
of the layered multicast Gaussian relay network is within a bounded gap of 0(M 2 ) bits, i.e., 
\Cg,MuU ~ Cd,Muu\ = 0(M 2 ). Furthermore, a near-optimal code for the layered multicast 
discrete relay network is also a near-optimal code for the layered multicast Gaussian network. 



Proof: Same as the proof of Lemma |5.2| ■ 

VI. Concluding remarks 

Our overall near capacity achieving strategy for a layered Gaussian relay network can be 
summarized as follows. The number n of bits of precision is chosen as the logarithm of the 
largest real or imaginary part of any channel gain. Then by simple quantization and truncation, 
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we create a purely discrete but stochastic network. This layered discrete network is operated in an 
en bloc fashion. At each node received signals are buffered for a block, based on which, transmit 
signals in the next block are generated. The coding strategy is particularly simple. Linear network 
coding is performed by simply multiplying the buffered vector by a square random binary matrix. 
The resulting long vector is broken into symbols for use in the next block. A similar strategy 
can be used for MIMO nodes, and also for multicast relay networks. 

The above strategy has the advantage that it employs a simple coding strategy requiring 
minimal signal processing at the relays. We note that most codes for relay networks do require 
considerable signal processing by the relay nodes consisting of non-linear operations such as 
vector quantization, compression, decoding, etc. In the linear coding strategy presented in this 
paper, each relay performs scalar quantization followed by a simple matrix multiplication. Non- 
linearity is thus introduced into the code due to quantization. Introducing some non-linearity is 
unavoidable in the strategy, indeed necessary, due to the inability of linear codes to achieve the 
capacity within a constant gap in the Gaussian relay network. 

The linear network code is a robust scheme in the sense that the relay need not know the 
channel gains on either the incoming or the outgoing links. Since the transmit and receive 
signals are quantized to binary tuples of length 2n, all the nodes only need to know the global 
parameter n. The quantization requirements of the linear network code are completely defined by 
the parameter n, which therefore also determines the resolution of the analog-to-digital converter 
(ADC) and digital-to-analog converter (DAC) that are required for operating the network within 
a bounded gap from the network capacity. 

The random matrix at the relays interleaves all the bits and perhaps increases the complexity 
of the decoding algorithm at the destination. It is an interesting problem to simplify this by 
constructing explicit encoding matrices for the relays that preserve the properties of a random 
matrix. This might help us construct a graphical model for describing the channel from the 
source to the destination and lead to a low-complexity code for relay networks that is decodable 
with iterative message passing algorithms. 

For a general Gaussian network with many sources, many destinations, relays, and any data 
transmission requirements, one can similarly construct the corresponding discrete counterpart 
and prove that the capacity region of the corresponding discrete network is contained in the 
capacity region of the Gaussian network. The converse remains to be proved however, if true: 
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that the capacity region of the original Gaussian network is contained in that of the discrete 
network, up to a constant gap. 

Since the network coding scheme proposed here for the wireless relay network case is the 
same as the network coding scheme that achieves the capacity of the wireline and the linear 
deterministic network, it is possible that the body of results developed in network coding could 
be possibly used to solve other problems in wireless networks too. 

Appendix A 

LOSS OF CAPACITY FOR UNIFORM INPUTS 
Lemma A.l: Consider a cut VL in a Gaussian network. Then the loss in the mutual information 



for choice of fractional inputs (as defined in Step 2 of Section III-2) is O(M). 

Proof: Denote the i.i.d. Gaussian inputs by {xf}. Also denote i.i.d Gaussian inputs dis- 



tributed as £/V(0, 1/3) by {xf}. Now consider the fractional inputs defined in Section III-2 
denoted by {xf}. 

We observe some similarities between {xf } and {xf}. Both the sets of inputs are i.i.d. and 
each input has the same variance in both cases. Hence, the covariance matrix of x G = (xf) 
or x F = (xf) is the scaled identity matrix Ijwxm/3. Denote the received signal in tt c under 
the Gaussian inputs and the fractional inputs by y G c and y^ c , respectively. It follows that the 
covariance matrices of the vectors (x G ,yQ C ) and (x F ,y F c ) are the same. 

Now consider the mutual information Ifaf,; yfi^x^), 

l( x n'iyn c \ x Q c ) = M I !]I%) _ ^rnr^ito) ("2) 

= ^(aft-M^la&.yJ.) (93) 

iefi 

= £(2tog(l/V2))-fc(a£|a&,i&) (94) 

= -\n\log2-h(xfi\x^,y^). (95) 



where ( |93] ) follows from the independence of {xi}, and ( |9"4] ) follows from direct computation 
of the differential entropy. Now, for the circular Gaussian inputs 

h(x^\xg) = J>(s?) = log(7re/3)|n|. (96) 



iefi 
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Since, for a given covariance constraint, the conditional entropy is maximized by the Gaussian 
distribution with the same covariance 11221 . 



fc04|a&,3&) < Hx%\xk,$&). (97) 



Substituting (96) and (97) into (95), 



J(a&J/n»|a&) > ^nl^SO - h(x%\x&,y&) - |Q|(log(7re/3) +log2) (98) 

> I{xg;yg\xg)-0{M). (99) 

Now the mutual information with i.i.d. CA/"(0, 1) inputs {xf } is given by 

J(a&3/&|zg c ) = log|/ + Kn^|, (100) 

where Hq is the channel transfer matrix across the cut £1. Now consider the same cut, but 
increase the noise variance at all the received signals from 1 to 3. The mutual information 
reduces to log \I + HciHq/3\. Since the effect of increasing the noise variance is the same as 
that of reducing the signal power to 1/3, 

J(£n;y&|a&) = log|I + 7fo?4/3|. (101) 



Comparing (100) and (101), 



/(xWl4) > I(xgyg\xg)-3\n\. (102) 

Hence 

J(a&J&|a&) > I(x^yg\x^ c )-0(M). (103) 

In []2l, it is shown that the loss in choosing the inputs to be i.i.d. Gaussian £/V(0, 1) instead 
of the joint Gaussian distribution is O(M) bits. Therefore the choice of fractional inputs {xf } 
leads to a loss of at most O(M) bits in the mutual information when compared to the joint 
Gaussian inputs. ■ 

Appendix B 

Probability of the error event E x 

Lemma B.l: The probability of the events E or E\\ (y , 1) ^ T, for any e > 0, is bounded 
by 

Pt(Eq) + Pr(.Ea) < e, for N sufficiently large. (104) 
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Proof: The proof of this lemma involves repeated application of the AEP for strongly typical 
vectors (see [20]). 

Suppose the source transmits the codeword x (l). From the AEP of strongly typical vectors, 
the received signal y . for a node in layer 1 is jointly typical with x (l), with probability exceeding 
1 — ei, for a sufficiently large N, for any positive e 1 . Hence, with high probability, (y ,,w) G T 
for j G L\. Node i in L\ subsequently transmits a message vector x { = My-, with x t G Xi(l). 
Then, the received vector y . at a node in £ 2 satisfies (by AEP of strongly typical vectors) 

(Vj, {XiJieAfQj)) G Te, p , (105) 



with probability exceeding 1 — 62, for a sufficiently large N, for any positive e 2 . In ( |105| ), the joint 
typicality of the vectors in T e is with respect to the joint distribution p{{xi}iej\f(j)j Vj)> where 
{%i}ieAf(j) are independent and uniformly distributed, and v{y'j\{xi\ieN{j)) models the channel 
in the discrete network. Hence, with high probability, (y.,1) G T for j G £ 2 . Subsequently, 
each node in £ 2 transmits a message vector from Xj'(l)- 

We carry out this analysis across all the layers in the network, and obtain that the received 
signal in Ck satisfies (y ., 1) G T with probability exceeding 1 — e&, for a sufficiently large N, for 
any e& > 0. The error event E\ occurs if the received vector at the destination is not associated 
with message 1, and this will not occur (with high probability) if all the received vectors at the 
intermediate relay nodes are associated with 1. Hence, we get 

Pr(-Bi) < y^e fc , for a sufficiently large N, (106) 

k 

where the summation is over the number of layers. 

Similarly to the above arguments, we can prove that the transmit and received message vectors 
in the network are strongly typical, with probability exceeding 1 — e , for a sufficiently large N, 
for any eo > 0. Hence 

Pr(£ ) < eo- (107) 

Since {e^} are arbitrary positive numbers, for any e > 0, 

Pr(£ ) + Pr(£i) < e, for a sufficiently large N. (108) 
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Appendix C 
Bounding the size of the typical set 

Lemma C.l: The size of the set \xc k -±{ w )\ (with the notations as given in the proof of 



Lemma 



4.2) is 2°( M ^ N , where the size is independent of channel gains or SNR. 



Proof: Consider the set 3^v(w) of network- wide received vectors associated with the message 



w, defined in the proof of Lemma 4.2 For every y E yviw), we can define a network- wide 
transmitted vector, associated with the message w, as 

x v (l/ v ) := (x Ci (y Ci ),x C2 (y c J, . . . ,x Cl (]/ Cl )). (109) 

Define Xv{ w ) to consist of all such network- wide transmitted vectors associated with message 
w. 

Consider the relay nodes in C\ and let us bound the conditional entropy i?((y i ).j g £ 1 |xo): 

ieCi 

= ^2H{[h u x + Zi]\xo) (111) 

-ieCi 

ieCi 
where z- is the integer part of the Gaussian noiserland Cj n is the carry from adding the fractional 
parts of the signal and the noise. Since Zj is distributed as £A/"(0, 1), H(zj) is independent of 
channel gains or SNR and H{z 3 ) = 0(1). Also H( Cj ) = 0(1), hence H(y' Ci \x ) = |A|0(1). 
Let y&iw) be the projection of 3^v(^) ont o the nodes in L\. For a sufficiently large N, by 
the AEP of strongly typical vectors, the size of y^ (w) is 

\y Cl (w)\ = 2 NH{V > 0) = 2 N ^°W. (113) 

Since every set of transmit vectors in X£,i{ w ) is associated with a received vector in 3^1 (V), 
the size of x& ( w ) i s a l so bounded by 

\ XCl (w)\ < 2^l°«. (114) 



2 



Zj n is a complex number lying in Z + iZ corresponding to the integer portion of the real and imaginary parts of the complex 
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Now, similarly to (|110[H[TT2]), we can bound 



H(y' C2 \x Cl ) < J2 H (v'j\ x ^ ( 115 ) 

jec 2 

= \C 2 \0(l). (116) 

Let us fix a set of transmit vectors x Cl E XCi{w)', men consider a set of received vectors y 
for the nodes in £ 2 such that 

(Vj, {Xihe^O)) e %p> for a11 3 e £ 2 . (117) 

The distribution p is p({xi}ieA0> 2/j')' wnere {^«}«eA^(i) are independent and uniformly distributed, 
and p(yj|{xt}ieAT(j)) models the channel from the nodes in A/"(j) to j in the discrete network. 
For a fixed x £ , the size of the set of received vectors y c that satisfy the above is given by (for 
a sufficiently large N, by the joint AEP of strongly typical sequences) 

2 NH(y' C2 \x Cl ) < 2 ^£ 2 0(1)_ /^g^ 

Define yc u £ 2 {w) as me projection of 3^v( w ) onto the nodes in L\ and C 2 . Now every set of 
vectors in y£ lt £ 2 (w) can be obtained by first fixing a set of received vectors y in y^iw), 

— Ls 1 

determining the transmit vectors generated from it as x c (y ), and finding a set of received 
vectors y that are jointly typical with this set x c (y ), as in ( |117[ ). By counting the number 



of vectors across the layers, the size of yci,Ca{ w ) i s bounded by 

|^,£ 2 HI < (2^°( 1 ))(2 JV£2 °«) = 2 N ^ +C ^ 0( ~ 1 \ (119) 



With the bound, obtained similarly to ( |112| ), 



Hiy'jxc^) < \C k \0(l), (120) 

we extend this argument across all the layers in the network. The size of yv(vj) can t> e bounded 



as 



\yv(w)\ < 2 JV ^^ « = 2 N °( M \ (121) 

Since each network- wide transmit vector in Xv( w ) is a function of a vector in yy(w), 

|XvWI = 2 NO{M) - (122) 

Since xc k -i( w ) is the projection of the set Xv(w) onto the nodes in Ck-i, the size of xc k -i( w ) 
is bounded by the size of xv{w). Hence, the lemma is proved. ■ 
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